基于强化学习的无标签网络剪枝

doi:10.16451/j.cnki.issn1003-6059.202103003

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (763 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract To remove redundant structures from deep neural networks and find a network structure with a good balance between capability and complexity, a label-free global compression learning method(LFGCL) is proposed. A global pruning strategy is learned based on the network architecture representation to effectively avoid the appearance of the suboptimal compression rate owing to network pruning in a layer-by-layer manner. LFGCL is independent from data labels during pruning, and the network architecture is optimized by outputting similar features with the baseline network. The deep deterministic policy gradient algorithm is applied to explore the optimal network structure by inferring the compression ratio of all layers through reinforcement learning. Experiments on multiple datasets show that LFGCL generates better performance.

Key words： Deep Neural Network(DNN) Network Pruning Network Architecture Search Reinforcement Learning

Received: 23 September 2020

ZTFLH:

TP 183

Fund:National Natural Science Foundation of China(No.61901238), Natural Science Foundation of Ningxia Province(No.2018AAC03020,2018AAC03025)

Corresponding Authors: YU Zhenhua, Ph.D., lecturer. His research interests include machine learning, computer vision and bioinformatics.

About author:: LIU Huidong, master student. His research interests include network compression and reinforcement learning.DU Fang, Ph.D., professor. Her research interests include big data management and artificial intelligence.SONG Lijuan, Ph.D., associate professor. Her research interests include image proce-ssing and computer vision.

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	LIU Huidong
	DU Fang
	YU Zhenhua
	SONG Lijuan

Cite this article:

LIU Huidong,DU Fang,YU Zhenhua等. Label-Free Network Pruning via Reinforcement Learning[J]. , 2021, 34(3): 214-222.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202103003 OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2021/V34/I3/214